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Since the passage of the Clean Air Act in 1970^ several epide • 
miological studies have attempted to associate morbidity 
with indoor and outdoor exposure to nitrogen dioxide (NO 2 ). 
The indoor, so-called gas stove studies 1 ' 7 produced mixed 
and inconclusive results in their attempts to link health 
impairment to the presence of a gas stove or gas hea ter in the 
home. Studies of the health effects of outdoor NO? expo¬ 
sures also have failed to find consistently significant health 
affects at ambient exposure levels. 1 " 11 

In an analysis of people living in Chattanooga, Tennessee, 
conducted under the Community Health and Environmen¬ 
tal Surveillance System (CHESS) program; Shy and Love 12 
were able to link N0 3 exposures and acute respiratory dis¬ 
ease. However, several problems have been raised about this 
study. The researchers have bean criticized for uaing rudi¬ 
mentary statistical techniques, consisting mainly of pairwise 
comparison of illness incidence rates in subpopulstions. In 
addition, the data base has been tainted by its association 
with the controversial CHESS program. 13 (The earlier 
CHESS Chattanooga studies were also criticized for using s 
subsequently discredited method (Jacobs-Hochheieer) for 
monitoring NO? concentrations. However, by 1972 the Salu- 
man technique was being used.] Yet EPA has found the 
Chattanooga data to be accurately transferred from the sur¬ 
veys to the computer tapes and our own research has re* 
vealed the data quality to be at least as high as other similar, 
but much less controversial data bases. 

The lack of persuasive epidemiological studies upon which 
To base a national ambient air quality s tandar d for nitrogen 
dioxide motivate the present paper. Here we return to the 
CHESS aerometric and health data bases collected during 
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1972-73 in Chattanooga and used by Shy and Love to exam¬ 
ine the relationship between NO? and acute respiratory dis¬ 
ease in children. We first describe and defend the CHESS- 
Chattanooga data base and the statistical model! used to 
examine it. We then present our mult* and discuss a num¬ 
ber of econometric issues and their relationship to our find¬ 
ings: 

The Data ftaag 

In January 1972, a self-administered survey on chronic 
respiratory disease (CRD) was distributed! to families with 
children in elementary schools in one of the three ChatUr 
noogi communities, Harrison, Brainerd, and Redbank, lo¬ 
cated within one mile of an air pollution monitoring station. 
A subsample of families—1970 parents and their children, 
4698 individuals in all—was drawn from this sample to par- 
ticipste in an acute respiratory disease (ARD) panel survey. 
Information was taken in two-week intervals (always begin¬ 
ning on a Sunday) over three school semesters from spring 
1972 through spring 1973. Each family was phoned within 
several days after the end of each two-week period to deter¬ 
mine if any family members experienced various acute respi¬ 
ratory disease symptoms or consulted a physician. 

Aerometric data were gathered at seven sites. Hourly mea¬ 
surements of NO? were taken using the Saltzman chemilu¬ 
minescence technique only for the fall 1972 and spring 1973 
study periods. Thus, we eliminated data for the spring 1972 
period from our analysis, Chattanooga was chosen as a site 
for an NO? study because it featured a TNT plant emitting 
large quantities of nitrogen-based pollutants. This plant 
closed January 1,1973, resulting in reduced NO? concentra¬ 
tions in the nearby communities, Daily readings were taken 
on particulates, nitrates, and sulfates. These daily readings 
were reduced to monthly frequency distributions. Unfortu¬ 
nately, the original daily data were unavailable from EPA. 
and we have been forced to use the monthly frequency distri¬ 
butions for the latter three pollutants. 

The Chattanooga health and aerometric data collection 
effort of the early 1970s and the CHESS program in general 
have been criticized (Rbth 14 ) for their poor survey protocols, 
health date inconsistencies, and aerometric data unreliabili¬ 
ty Krupnick and Harrington 15 provide a complete reanaly¬ 
sis of these data and find, first, that the survey protocols 
were carefully designed and observed^ In addition, responses 
to identical sociodemographic questions on the CRD and 
ARD surveys were found to be quite consistent. Also, the 
NO 2 monitoring data were found to be reasonably complete, 
generally consistent, and taken by devices that generally 
outperform other types of monitors in the lab. 

Further, because a duplicate CRD survey was adminis¬ 
tered to some of the participant* 22 months later, we were 
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able to identify inconsistencies inttrumporaliy Drawing on 
responses from 948 parents and considering only the ques¬ 
tions concerning age, race, birthdate, education, and smok¬ 
ing status, over 80% of the parents had matching response* 
over the two surveys. These results compare favorably to 
similar investigations of tha US. Census and other highly 
regarde d data baaaa. 14 * 17 

IrTthe courts of our a lamination of the quality of this dau 
base we noted a number of recall-related problems inherent 
in the survey proCedura. Theta problems may alto be present 
in parts of other surveys, such as the Health Interview Sur¬ 
vey (HIS); that rely on biweekly interviews to collect acute 
respiratory disease data. These problems are discussed in 
some detail elsewhere, 1 * but the main points may be summa¬ 
rized as follows: 

1) Respondents have imperfect recall of the day or even 
week of onset of illness. For example, over 60% of illnesses 
reported were said to have occurred in the second week of 
the recall period] a result significantly (a » 0.01) different 
from the uniform distribution one would expect Apparently 
respondents either forget (presumably minor) illnesses oc¬ 
curring in the first week, or they remember disease onset as 
occurring later than was actually the case. 

2 ) When average duration of reported illness is plotted as 
a function of day of onset during the two-week period, a 
linear decline is found during the second week of the period, 
with average duration at the end of the week barely half of 
average d uration at the beginning. The most likely explana¬ 
tion is that illnesses extending past the end of the period are 
not reported accurately, even though interviewers were in¬ 
structed to identify such illnesses and ask about them at the 
end of the next reporting period. If this explanation is cor¬ 
rect. the truncation of restricted activity days imparts a 
downward bias to illness severity. 

3) In other panel studies it has been suggested that re¬ 
spondents may, overtime, progressively under-report illness 
simply because they become tired of doing interviews. If 
pollution levels are time-dependent, the study results may 
be biased accordingly We found little evidence of this phe¬ 
nomenon. On the assumption that less serious illnesses could 
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Variable 

Mess value or 
populs ti on fraction 

NEWILL 

0.13 

RADS 

0.21 

AGE 

7.7 

Age distribution 

0-2 

0.09 

3-4 

0.08 

5-6 

0.16 

7-8 

0.23 

9-10 

0.25 

11-12 

0:19 

RACElW 

0.91 

CHESTINF 

0.28 

CHRON 

0.07 

Education of household head 
High school graduate 

0.71 

Attended some college 

0.45 

MOMHEAD 

0.05 

Mother* 4 smoking status 

Current 

0.32 

£x- 

0.15 

Nom 

0.53 

CROWD 

1.30 

SEXlF 

0i48 

GAS 

0J05 

RAIN 

2:70 

TEMP 

18.20 


Table II Pollution statistics. 



Mean 

Standard 

deviation 

CorreUtion coefficient, 
PAR90P SUL90P TEMP 

N02MAX 

98.0 

483 

-0.10 

0.20 

-009 

PAR90P 

1006 

314 


—0027 

034 

SUL90P 

10.0 

2.7 



-0036 


be more likely to be neglected: wr regressed the ratio of 
“serums’* to total illness incidence on time, and found no 
trend. 

These findings affected our subsequent dau analysis in 
two ways. First no attempt was made to use time intervals 
shorter than two weeks, even though the sample could be 
reduced to weekly or even daily observations. Second, we 
concentrated on the incidence of illness rather than dura¬ 
tion, inasmuch as we felt the former to be more reliable. 


tha Model 


To identify the factors affecting reported children's dis¬ 
ease, we use pooled cross-section time series models predict¬ 
ing illness incidence or duration ss a function of demograph¬ 
ic, pollution, and weather variables. Symbolically, the mod¬ 
els are of the form 


where S lJt 


X„ 

p,< 

w, 


Sift m /(Xi;, Pyf, W t ) 4 €,y| 

is the reported incidence or duration of the ill¬ 
ness of the ith child in the jxh neighborhood 
during period t , 

is a vector of personal variables for the ith child 

in theytb neighborhood, 

is a vector of pollution variables for the >th 

neighborhood in period t t 

is the weather in period r, and 

is the disturbance term. 


The independent variables are defined as follows: 

AGE: the child’s age at the beginning of the school 

year. 

RACElW: the race of the head of household 11 if white.O 

if nonwhite. 

CHESTINT: 1 if the child has suffered a respirstory infec¬ 
tion within the past three years. 0 otherwise: 


CHRON: 


EDU: 

MOMHEAD: 

SMKPPD: 

CROWD: 


1 if the child suffers from asthma or a chronic 
heart or lung condition, 0 otherwise, 
the years of schooling completed by the head 
of household. 

1 if the household head is female, 0 otherwise, 
mother's smoking in packs per day, 
number of household members divided by 
the number of rooms in the house. 


SEXlF. J sex of child; 1 if female, 0 if male. 

GAS: 1 if the kitchen stove is gas. 0 if electric. 

RAIN: amountof rainfall during the period, in inch * 


EPIDEM: 

TEMP: 

N02MAX: 

PAR90P: 

SUL90P: 


es. 

monthly influenza cases reported by the 
State of Tennessee (in thousands), 
the absolute difference between the average 
temperature during the period and 65*. 
average daily maximum concentration of 
NG 2 , in Mg/m 3 . 

90th percentile total suspended particulate 
concentration during the month, in Mg/m 3 . 
90th percentile sulfite concentration during 
the month, in jig/m 3 . 


As noted above, two dependent variables are considered: 
illness incidence (NEWILL), which is 0 or 1 according to 
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Table III. Predicting illness incidence in population subsampie* * 


* 

A 

B 

c 

D 

Children 

E 

F 

G 

H 

I 


Alii 

Children 

Children 

without 







children 

with 

with 

chrome 

Children 

Children 





12 and 

nonsmoking 

smoking 

respiratory 

with 

6 and 


Fall only 



under 

mothers 

mother* 

disease 

CRD 

under 

Infants 

Spring only 

Intercept 

TK55TT 

0.064 

-0.0197 

0052 

0.098 

0.037 

0.1511 

0.161 

0J016 

(00503) 

(0.059) 

(0.091) 

(0.056) 

(0.095) 

(0.10) 

(0.21) 

(0.113) 

(0.075) 

N02MAXL 

-8.87,F< 

—6.10E-4 

-13.5E-4 

—6.4 E-4 

—13:6E-4 

—8 44E.4 

-12.5E-4 

0.26E-4 

-ISjOE-4 


<2.05E4»' 

(2.46E-4) b 

(3.71E-4)' 

(2.3E-49* 

(4.0E-4)' 

<4.lE-4t b 

<8:7E-4) 

(4.3E-4) 

(2-9E*4!)‘ 

N02MAXH 

1.71E-4 

2.19E-4 

-57E-4 

2.3E-4 

•51E-4 

2 41E-4 

5.2E-4 

0.06E-4 

395E-4 


(0.68E.4l b 

<0.78E-4!l t 

(1.46E-4) 

<aL13E-4!> k 

(1.34E-4) 

(1.89 E-4) 

(2.55E-4) k 

(0 78E-4) 

(2.0IE-4) b 

PAR90P 

B.88E-4 

6 69E-4 

14.3E-4 

11.4E-4 

4.00E-4 

12.1 E-4 

13.0E-4 

28.1E-4 

134E-4 


(6.08E-4) 

(7 4E-4) 

(10.8E-4) 

(6.8E-4) 

(12E-4) 

01.9E-4) 

(24E-4) 

(15E-4) 

(9.1E-4) 

PAR90P2 

—0.053E-4 

—0.044E *4 

—OJ075E-4 

-0.058E-4 

-0:04 IE-4 

-0.066E-4 

0045E-4 

—0.146E-4 

—0J056E-4 


(0.02SE-4) 1 

(0.03 Em 

(0.048E-4) 

(0.031E-4) 

(0.0555) 

(0.054) 

(0.11E-4) 

(0:074£-4) 

(0.039E-4) 

SUL90P 

135E-4 

1I7E-4 

191 E-4 

89.8E-4 

231E-4 

128E-4 

219E-4 

-156E-4 

95.3E-4, 


<45.9E-4>< 

(64E-4» b 

(87E-4) b 

(53E-4) 

(87.8E-4)' 

(90E-4) 

(l»SE-4) 

(217E-41 

(65.7E-4) 

SUL90P2 

-5.54E-4 

-4.81E-4 

-7.91 E-4 

-3.67E-4 

—9.36E-4 

—5.46E-4 

-U.6E-4 

6.76E-4 

-3:38E-4, 


(1.72E-4>' 

(20E-4>‘ 

(3.32E-4) b 

(1.97-41 

(3.24E-4)' 

(3.3E-4) 

(7.5E-4) 

(9J9E-4) 

(2.2E-4) 

AGE 

—00135 

-0.0165 

-0:0252 

-0022 

-0:010 

d 

e 

-0.0207 

-0:0164 


(0.00341‘ 

(0.0042 r 

(0.0057,)' 

(0.0037)' 

(0.00711) 



(0.0047)' 

(0.0048)' 

AGE2 

7.53E*4' 

514E-4 

14.9E-4 

8.99E-4 

3J32E-4 

d 

e 

8.71E-4 

6.4E-4 


(2.44E-4)' 

(3.0E-4) 

(4.2E-4I* 

(2.66E-4)' 

(5.27E-4!) 



(3:47E^t* 

(3.46E-4) 

CHEST! NF 

0.0475 

0.0495 

0:044 



0.071 

0.103 

0.046 

0:048 


(0:010 k 

(0012)' 

(0.018)' 



(0.019 )f 

(0:042 ) b 

(0:014)' 

(0 015)' 

CHRON 

0 044 

0.0390 

0J052 



0.036 

0.003 

0.036 

0:052 


<0:0059 K 

(0.0071)' 

<0.01!ll)' 



(0.011)' 

(0:024) 

(0.0083)' 

(0.0084)' 

CROWD 

0.0184' 

0.0163 

OJ018 

0.019 

0.011 

0.037 

0.029 

0.022 

0:0156 


I0J00T3 ) b 

(0.0090) 

(0013) 

(0.0083 ) b 

(0.0141) 

(0.-017)** 

(0.715) 

<0:010) b 

(0.010) 

EDV 

-0.0012 

-0 0045 

0:0056 

-00016 

-0:0060 

-0.0029 

-0.0031 

-0.00028 

—0i0024 


(0.0021) 

(0.0028) 

(0.0039) 

(0.0025) 

(0.0095) 

(0.0046) 

(-0.315) 

(0.0031) 

(0.0032) 

EPIDEM 

0.072 

0.089 

0036 

0.055 

0:099 

0.064 

0.059 

-0.23 

0:098 


(0.011)? 

(0 013)' 

(0.019) 

(0.012)' 

(0.021!)' 

(0.022)' 

(1.31) 

(00851' 

(0.017,)' 

SEXlF 

0 0076 

0.0083 

0.006 

0.0066 

OjOU 

-0.0080 

0.0027 

0.0021 

0:012 


<0.0052) 

(00063) 

(0.0092) 

(0.0058) 

(0.010* 

(O.Oll)i 

(0:023) 

(0.0072) 

(0.00711) 

SMKPPD 

-0.0013: 


0.0019 

-0.0027 

0J0021 

-0.0050 

-0.0058 

0.00126 

-00037 


(0.0020) 


(0.0049) 

(0.0023) 

(0.0036) 

(0:0040) 

(0:0086) 

(0.0028) 

(0.0028) 

GAS 

—0 020 

-0.044 

0.012 

0.0044 

-0.061 

-0.057 

-0.070 

-0.0012 

-0:036 


(0.012) 

(0:015)' 

(0.019) 

(0.014) 

(0.021)' 

<0:026 ) b 

(0.061) 

(0.016) 

(0.016 ) b 

RAIN 

-0 0056 

-0.0068 

-0.0027 

-0.0050 

-0.0060 

-0 0054 

0.0019 

-0.019 

-0.0027 


<0.0016 i c 

(0:0019)' 

(0.0027) 

(0.0017)' 

(0.0031) 

(0.0032) 

(0:0064) 

(0.0053)' 

(0.0020) 

TEMP 

0 0022: 

0.0021 

0.0025 

0.0020 

0:0026 

0.0029 

0.0015 

0.0018 

0.0026 


(0.0004UK 

(0:00046)' 

(0.00072)' 

(0.00045)' 

(0.00081)' 

(0:00082)' 

(0:0018) 

(0.00086 ) b 

(0.00066)f 

RACE1W 

0.056 

0.039 

0.073 

0.051 

0:060 

0.059 

0.087 

0.034 

0075 


(0.0090)' 

(0:012)' 

(0.141' 

(0.0095)' 

(0.019)' 

(0.017 If 

(0039 ) b 

(0 013)' 

(0.013)' 

S 

16474 

111497 

4977 

11557 

5246 

5108 

1387 

8176 

8296: 

F 

25 5 

19-7 

6.83 

15.29 

833 

568 

1.84 

8.37 

22:5 

W 

0 0286: 

0030 

0.033 

0.022 

0:026 

0.027 

0.025 

0.019 

0:049 


* Standard errora in parentheses 
b Significant at the 5% level. 

' Significant at the 1* level] 

4 AGE and AGE2 were replaced by dummy variables AGEONE ( -1 if AGE • 1.0 otherwise) and AGETWO. 
•AGE and AGE2 were replaced by dummy variables AGEONE through AGESIX, 


whether the child it reported ill during the two-week period 
in question; and duration of restricted activity (RADS), 
which takes an integer value between 0 and 14. 

Tables I and II provide descriptive statistics on these 
variables. Note the low number for mean RADS, indicating 
the large percentage of observations with a zero value for this 
variable. Correlation coefficient* between each of the pollut¬ 
ant* and temperature are also provided. Note that correla¬ 
tion* between pollutants are all quite low. We searched for 
more complicated patterns of colUnearity by using the diag* 
nostic tests 1 * provided with the SAS regression package. 
T^ese tests failed to reveal any serious collinearity problems 
involving any of the independent variables. 

We relied primarily on a linear probability model for our 
anal>Tis. using ordinary least squares (OLS) as the estima¬ 
tion procedure, the results of which are presented below. 
However, the OLS model requires a number of assumptions 
of questionable validity for the current problem* We discuss 
later the effects of these assumptions on the outcomes. 
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Rttulta 

Table HI shows the results of the regressions predicting; 
illness incidence. Column A give* the results for the entire 
sample of children aged 12 and under. The remaining col¬ 
umns show results for s number of subpopulations; we used 
these results to examine the stability of the coefficients and! 
to identify populations especially sensitive to the pollution i 
variables. Thus, Columns B and C give results for children of 
mothers who do and do not smoke, and Columns O and E 
give results for children with and without chronic respira¬ 
tory disease or a history of respiratory ailmenta. In Columns 
F and G we examine the illhess incidence in younger chil- 
dren. Finally, in Columns H and 1 we divide the sample into 
fill (October-December 1972) and spring (January-April 
1973!) time periods. 

The specification of the NO; variable was piecewise linear, 
with a break at 100 Mg/m 3 ; This specification was the best of 
all those examined] In Table III N02MAXL and NO-- 
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2MAXH refer, respectively, to average daily maximum con¬ 
centrations below and above 100 Mg/m 3 . A* shown, for NO* 
2MAX concentration! below 100 Mg/m?, illhesa probability 
falls relatively sharply as N02MAX increases. Above 100 
illness probability gradually increases with NO* 
2MAX. but the illness rats at the highest observed two*week 
maximum concentration (3&4 Mf/m 3 ) is less than that at the 
lowest observed concentration (27 Mg/m 3 ). The clinical liter* 
ature gives no reason why a dose*response function should 
have these characteristics. 

In the subsamples (Columns B to I), the above relation¬ 
ship between N02MAX and illness is replicated for children 
with nonsmoking mothers and children without a history of 
respiratory disease. For both preschool age children and 
infanta; the coefficients were similar although not always 
significant. However, for children whose mothers smoke or 
have s chronic respiratory condition, N02MAXH has virtu¬ 
ally no effect. Finally, when only fall periods are examined, 
N02MAX is not related to illness at all. 

These exceptions did not increase our confidence in the 
results. The first two exceptions suggest that a “sensitive 
population** for NO) is healthy older children of nonsmoking 
mothers. If so, perhaps the presence of a chronic condition 
swamps the small NO) effect Likewise, perhaps for children 
exposed to parental smoking an additional NO 2 effect can¬ 
not be detected. One problem with this explanation is that 
we found no adverse effect of mother's smoking on children's 
health. As for the absence of an NO) effect in the fall, we note 
that the prevalence of illness in that season was relatively 
low in any event If the effect of NO 2 is to reduce resistance 
to disease, we might expect to find no NO 2 effect when little 
disease is present in the community. Such speculation not¬ 
withstanding, we Have not found an effect from NO) that is 
supported by clinical evidence or that is present in all popu¬ 
lation subgroups , 

For sulfates and particulates the best fits were obtained 
for the 90th percentile of two-week concentration and a 
quadratic specification, with a positive linear and negative 
square term. The paniculate results were reasonably consis¬ 


tent across subpopulations: but rarely significant at the S'V 
level. Moreover, the various functions were such that the 
effects of particulktes on illness were negative at concentra¬ 
tions below 80-100 Mg/® 3 , which is near the average 90th 
percentile concentration. That is. oven much of the relevant 
range the particulate variable is inversely related to illhess: 

The coefficients for sulfates are significant for the entire 
sample, but not for the fall and spring semesters separately; 
For falli the coefficients enter with signs and reverse of all 
other subaamples, but the t -values are very small! For 
spring, the coefficients are similar to coefficients in other 
equations but the r-values still are not significant For the 
population subaamples the aulfate coefficients are stable 
and significant except for infanta, where, as we have noted, 
sample sixes are much smaller and an effect of population on 
health would be correspondingly more difficult to identify. 
The inconsistent seasonal results may be related to using 
weighted averages of monthly summaries of daily readings 
instead of two*week averages, which were unavailable for 
particulates and sulfates. 

Turning briefly to the other explanatory variables, the 
moat statistically significant and robust results were for vari¬ 
ables that one would expect to be associated with respiratory 
disease: age, a history of chest infection, presenceof a chronr 
ic condition, the extent of crowding in the homeland outside 
temperature. Not only were these variables almost always 
significant, but the coefficients were stable across subpopu^ 
lations. The coefficients for CHRON (presence of chronic 
disease); for example, varied between 0.036 and 0.052, ex¬ 
cept in the equation for infanta, and indeed very few infants 
in the sample were diagnosed for a chronic disease. The 
EPIDEM variable was also generally significant but in the 
fall the sign was negative, a result we believe to be fortuitous, 
inasmuch as the variable was very small in absolute value 
during that season. 

For two other variables, RACElW and!RAIN, the results 
were stable and significant. White children consistently re¬ 
ported more new illness than nonwhites We also found a 
consistent inverse relationship between the amount of rain- 


Table IV. Comparison of specifications of NO? variable* in equations predicting illness incidence " 



A 

B 

C 

D 

E 

F 

G 

N02MAX 

1.4E-6 

-3.5E-4 

—21.7E-4 






(56.8E-6) 

(1.82E-4)* 

<4.37E-4'»< 





N02MAX2 


0.009E-4 

11.3E-6 







(0.0045E-4) 

(2.31E-6)' 





N02MAX3 



-1.59E-8 

<0.35E-8>‘ 





NO2MAXI0-T5I 




-19E-4 

-21E4 







(4.63Er4l» e 

(4.88E-41' 



M02MAX(75-150I 




-0.16E-4 

(0.94E-4I 




N02MAXI75-100) 





-0.28E-4 

(0.80E-4) 



N02MAXI100-150) 





112E-4 

(1.1E-4I 



M02MAX0150) 




1.7E-4 

0.56E-4 







(1.15Et4^ 

(1.19E-4I 



N02MAX(0-100) 






-8.9E-4 

(2.05E-4)' 


N02MAX01001 






17E-4 

(0.69E-4I* 


N02AVG(0-50t 







—9 4E*4 

44 \\E-A)P 

N02AVG050) 







1.96E-4 

(3.8E-4) 

F 

25.8 

24;7 

24.5 

24.0 

23.2 

255 

24 7 

R* 

0J0274 

0.0277 

0.0289 

0.0281 

0.0288 

0.0286 

0.027B 

S 

16474 

16474 

16474 

16474 

16474 

16474 

16474 


* Standard errors in parenthe 

* Significant at the 5% level 

* Significant at the 1% level. 


to 

© 

N 

CJ 

CJ 

•si 

CD 

«sl 

cn 
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Table V. Croat section ume tenet regre***^.. *.v„. 

• illness incidence snd restricted activity, days:.full sample v*. 
sample consisting of one child per family,* 


Variable 

Illness incidence 

Restricted activity days 

— rai 

•ample 

sample 

FuF 

•ample 

5ub- 

aemple 

Intercept 

0.0512 

-00016 

0346 

0.275 


(0.06) 

(0.73) 

(0t35)‘ 

(0.20) 

N02MAXL 

-887E^4i 

-7J8E-4 

—25.QE-4 

-28.7E-4 


(2 05E-4)« 

(19E-4)' 

(56E-4I' 

(8.3E-4)' 

N02MAXH 

H71E-4 

1.39E-4 

4.4E-4 

6.22E-4 


<(168E-4)‘ 

(0.99E-4) 

(1.86E-0* 

(2.8E-4)* 

PAR90P 

8.88E-4 

17.8E-4 

-1B.3E-4 

9.80E-4 


(6.08E.4I 

<86E-4) k 

(16.6E-4) 

(24.SE-4) 

PAR90P2 

-0.QS3E-4 

-0.068E-4 

0.064E.4 

-0.04 E-4 


(0.028E-4) 

(0.040E-4)* (0J075E-4) 

(0.099) 

SUL9QP 

13SE-4 

107Et4’ 

212E-4 

73.3E-4 


(45.9E-4V 

(66E-4) 

(126E-4) 

(1B5E-4) 

SUL90P2 

-5ME-4 

-4.73E-4 

-9.3E-4 

-4.26E-4 


(1.72E.4)' 

(2.SE-4I 

(4.7E-4) b 

(6.9E-4) 

AGE 

—0.0165 

-0020 

-0.045 

-0.044 


(00034)' 

(0.0049)' 

(0.0092V 

(0014V 

AGE? 

7.53E-4 

8.54E-4 

20.9E-4 

19.1 E-4 


(2.44E-4)' 

(3.6E-4)* 

(6.7E-4)' 

(10.0E-4) 

CHEST1NF 

0 0475 

0.053 

0.178 

0.196 


(0.010)' 

<0:015V 

(0.028)' 

(0.042V 

CHHON 

0 044 

0.047 

0:099 

0.110 


(00059 V 

(0.0064 V 

(0.016)' 

(0.023 V 

CROWD 

0.01I&4 

0:018 

0.019 

0.0080 


( 000 : 31 * 

(0.010) 

(0.020) 

(0:026) 

EDU 

—00012 

0.0028 

-0.0087 

-0.0023 


(0.00211) 

(0.0032) 

(0.0061) 

(0.0088) 

EPIDEM 

0 072 

0.080 

0.228 

0221 


(0.011)* 

(0 0151* 

(0.0291' 

(0.042V 

SMKPPD 

—0.0013 

-0.0039 

0;0004 

0.0086 


(0:0020) 

<0 0028) 

(0.0050) 

(0.0078) 

GAS 

-0.020 

-0:0021 

-0.037 

0.0021 


(0.012) 

(0.0016) i 

(0.032) 

(0.045) 

RAIN 

-0:0056 

-00049 

-0.0037 

0:0025 


(0.0016 V 

(0:0022) b 

(0.0043) 

(0.0061) 

TEMP 

0.0022 

0.0020 

0 0029 

0.0030 


(0j00O4) c 

(000057V 

(0.0011V 

(0.0016) 

RACEIW 

0.056 

0.065 

0:127 

0.136 


(0 0090V 

(0.014V 

(0.025 V 

<0:038V 

SEX1F 

0.0076 

0:0094 

0.023 

-0.0039 


(0.0052) 

(00074) 

(0014) 

(0.02:1) 

S 

16474 

8158 

16474 

8158 

F 

255 

13.6 

205 

106 

R 7 

0.0286 

0.031 

0:023 

0.024 


•Standard errors in parentheses 
% Significant at the 5% level 
• Significant at the 1% level 


falli and the incidence of illness in a two-week period, al¬ 
though the magnitude varied by a factor of six between fall 
and spring: Again, we have no explanation for this result. In 
addition, the presence of a gas stove in the house appeared to 
be unrelated to diaeaae incidence. Few significant results 
were obtained, and for those that were significant the sign 
was contrary to expectation. As only 5% of the households 
cooked with gas, thaae generally inconsistent results are not 
particularly surprising. 

Other covarietea had virtually no explanatory power, and 
were rather unstable serosa subpopulktions; educational lev- 
dlbf head of house hold, sex. and mother's smoking status. In 
particular, we found!that a mother's smoking in the home 
tras unrelated to acute respiratory disease incidence of her 
jfchildren. However, this should not be too surprising in view 
r of the contradictory findings on the health effects of passive 
J amoking, 19 

An analysis was also carried out for illness duration as the 
dependent variable In this case the dependent variable S i;< 
took an integer value between 0 and 14* OLS estimates 
predicting illness duration were very simiUr to the rtsulu 
presented above; that is, independent variables that were 
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alsO'Signiucaiu r 

However;, OLS estimates of truncated variables are inconsis¬ 
tent as well as inefficient, 20 so it is especially important to 
compare the results to those of s more suitable estimation 
procedure Thus, illness deration was also investigated using 
Poisson regression; and the comparison between Poisson 
and OLS is discussed below. 

fcoma Srobtsm of Esttmttbn 

A pervasive problem in the estimation of the effects of air 
pollution on illness is that information on personal exposure 
to pollutants is rarely available. Researchers have been 
obliged to uae ambient monitoring data as a proxy for per¬ 
sonal exposure, and our study is no exception to this rult 
Nonetheless, every child in our sample lived and attended 
school within a mile of a monitoring site, a relatively tight 
radius compared to most similar studies. 

Besides this measurement difficulty, there were several 
major econometric problems. These problems arose primari¬ 
ly from our desire to use a linear probability model and OLS 
as the principal estimation procedure. Convenient though it 
may be^ the OLS model requires a number of assumptions of 
questionable validity for the current problem. The question 
we now examine is whether these refinements make much 
difference to outcomes. 

The first problem is that the dependent variable S l}l is 
limited to the values 0 or 1 (for illness incidence) or to the 
small positive integers (for illness duration). Thus, the OLS 
estimators are not efficient, and the Linear probability model 
may not be appropriate in any event. 

A second problem is concerned with the functional form of 
the relationship between illness and air pollution (indeed; 
between illness and any explanatory variable). As there is no 
theory to guide the selection of functional form, we chose a 
functional form on the basis of an information criterion 
proposed by Sawa. 21 

The third problem involves the structure of the distur¬ 
bance term We examined two alternatives to the OLS 
assumption of uncorrelated disturbances: 

Autoregression: an individuals health status in one period 
may affect his or her healthiitatus in subsequent periods, in 
which case £u iyf < l/f 7 * 0 for t * V. 

Contugion: one s health may be affected by the health of 
others, especially family members and classmates, in which 
case E(< l/f v/r) * 0 for i ¥■ i' or / * /. 

These problems were examined sequentially. First, sever¬ 
al alternative functional forms were examined. Having se¬ 
lected a functional form, we then examined the error struc¬ 
ture Finally, alternative estimation procedures more suited 
to limited dependent variables were investigated. 

Functional Fonw 

Table TV showi the relationship between illness incidence 
and NO* for several different specifications of the pollution 
variable. The basic variable was N02MAX, the daily maxi¬ 
mum N0 2 reading, averaged over the two*week period. (Not 
shown are specifications using average pollution variables; 
which give results inferior to the ones for N02MAX.) 

The specifications examined include the following. 

• linear specification 

• quadratic 

• cubic 

• piecewise linear functions with one break point at 100 
Mg/m 3 , two break points at 7S*and 150 Mg/na 3 ; and three 
break points at 75,100 and 150 Mg/m 3 . 

In all specifications, except the linear, the relationship 
between N0 2 and illness incidence is U-shaped: Based on 
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Table Vli Comparison of loyit and OLS models. predicting illness incidence. 


OLS Logit 


Coefficient 

Std. error 

Coefficient 

Std error 

.35 





dr 


Intercept 

0.176 

0.029* 

-1.80 

0 256 


N02MAXL 

-898E-4 

2.04E-4* 

-0.0067 

0.00174* : 

—7.63E-4* 

N02MAXH 

1.6IE 4 

0.68 E-4* 

0j00135 

0.000595* 

1.53E-4 - 

PAR90P 

-2.63E-4 

0.923E-4* 

—*0:00187 

0:00082* 

—2i3E*4 l 

SUL90P 

2.S5E-4 

12.1E-4 

-0:00138 

00108 

-l.57p.a 

AGE 

-0:0062 

0.00085* 

-0:0715 

00074* 

-0.00614* 

CROWD 

0.150 

0.0071* 

0.151 

0064? 

0:0172* 

EP1DEM 

0074 

0:010* 

0.537 

0.087* 

0 061* 

GAS 

—0:020 

0.012 

-0.187 

0.110 

-0.020 

RAIN 

-0.0063 

0.0016* 

-0.051 

0015* 

-0.0058* 

TEMP 

0:00164 

0100039* 

0.0167 

0.00356* 

0.00190* 

RACE1W 

0.0540 

0j0O88* 

0.542 

0.093* 

0.0528* 

CHRON 

0JO416 

00059* 

0.350 

00450* 

0.0423* 

CHESTINF 

0.0460 

00102* 

0342 

0.081* 

0.0434* 


• Significant at the 5% level. 
k Significant at the 1% levaL 


the BIC criterion proposed by Sawa, 21 the beat performer is 
the piecewise linear specification with a break point at 100 
Mf/m 3 . 

Wt alio tested these spline specifications against the quar 
dr a tic specification using one of the tests described by Da¬ 
vidson and MacKinnon 22 for non-nested models. The result 
of this teat was as follows; When the quadratic specification 
was taken as the null hypothesis a«ainst the piecewise linear 
alternative, the null hypothesis was rejected. However, with 
the spline taken as the null hypothesis the null could not be 
rejected. Thus, the spline specification with a break point at 
100 Mg/m 3 fit the data best, and this was used in subsequent 
work. 

Error Stovctura 

To examine the effect of possible serial correlation we 
assumed a first-order autocorrelation scheme and used a 
two-stage procedure described by Kmenta. 23 First we esti¬ 
mated the autocorrelation parameter p using OLS, and then 
reestimated I the model 

(Vi "* p V<-i) m (X t “ + (* r — p<i-i)j 

Our estimate for p wu p * 0.036. In the second stage, we 
found the following results for the NOj variables, which, it 
will be noted, are essentially the same as Column H of Table 
111 : 

S « —9.35E-4 N02MAXL 


with n - 14907 and F - 25.1 for the equation. This result 
indicated that the problem of autocorrelation could be ig¬ 
nored. 

Contagion presented a problem that we were not able to 
resolve fully, due to alack of complete information on all the 
physical contacts among the various members of the sample. 
However, we were able to examine contagion in the home, 
one of the most likely places where diseases may be spread. 

If contagion in the home is present, the estimated effect on 
incidence and duration of variables common to members of a 
family, such as their exposure to air pollutants, will exceed 
the true effect. To test for this possibility we compared 
regression results from the full sample to the results from a 
subset consisting of one child chosen randomly from etch 
family represented in the sample (Table V). Although the 
standard errors on the former are a bit larger, (which is what 
one would expect from the reduction in sample size), the 
coefficients are quite similar. Thus, the results are probably 
not much affected by spread of disease in the home. 


United D epe n dent Variables 

In this section we examine whether the results depend on 
our use of OLS rather than techniques more suited to limited 1 
dependent variables. Specifically, we tested the linear prob¬ 
ability model against a logit model for predicting illness 
incidence, using the “G" test described by Davidson and! 
MacKinnon. 22 This test showed the logit model to be superi¬ 
or in the following sense: When the null hypothesisifo is the 


(2:12E4» + l;76E-4 N02MAXH + other term* 

(0.81E-4) 

Tabu vn. Comparison of Poisson and OLS models predicting Ulness duration. 


OLS _Poisson 


Coefficient 

Std. error 

Coefficient 

Std. error 

as 





dx 


Intercept 

0.352 

0.079* 

-1.62 

0.361* 


N02MAXL 

-0.00249 

0.00056* 

-0.0063 

OJ0023* 

—000173* 

N02MAXH 

0.00043 

0000186* 

0.00195 

0.00084? 

0:00041* 

PAR90P 

-0.0047 

0.00025 

-0.00101 

0.00120 

-00021 

SUL90P 

-0.0026 

0.0033 

-0.0120 

0j016 

-0.0025 

AGE 

-0.0174 

0.0023* 

-0.078 

0:0102* 

-00162* 

CROWD 

0.0089 

0.0193 

0.075 

0.092 

00156 

EPIDEM 

0.216 

0.028* 

0.768 

0.117* 

0160* 

GAS 

-0.032 

0.031 

-0.155 

0.158 

-0030 

RAIN 

-0.0044 

0.0042 

0.0022 

0.0195 

0.00046 

TEMP 

0.0030 

0.0011* 

0.0152 

0.0052* 

00032* 

RAGE1W 

0.119 

0.024* 

0.668 

0.148* 

0.114* 

CHRON 

0.091 

0.016* 

0.399 

0.069* 

0093* 

CHESTINF 

0.178 

0.028* 

0.571: 

0.097* 

0155* 


1 Significant at the 5% level 
k Significant at the 1% level. 


Source: https://wwwjndustrydocuments.ucsf.ec^0^8k?Pjfft<WWy 
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OLS model and the logit model is H\, Ho is rejected; however, 
’when the roles are reversed and //© is the logit model, Ho 
cannot be rejected. , *' 

For illness duration a similar comparison was made be¬ 
tween OLS and Poisson regression. Again, the OLS model 
was found to be inferior, fefooatbeieas. the coefficienu on the 
independ ent vari ables aattmaud using OLS were very simi¬ 
lar to the corresponding coefficients for the logit and Poisson 
modeh. These coefficienu art compared in Tables VI and 
VII. To facilitate comparison, the rightmost column of each 
table is the derivative of the dependent variable of the Pois¬ 
son or logit function, evaluated at the mean of the dependent 
variable (For discrete independent variables the entry is the 
average change in probability of illness, estimated! by the 
weighted sum of the change in probability when the variable 
is added at the mean and when it is taken away.) Even 
though OLS appeared slightly inferior to logit and Poisson 
regression in predicting illness incidence and duration, the 
qualitative result* were hardly affected. 


Conductor* 

A CHESS dau base from Chattanooga. Tennessee was 
thoroughly scrutinized and found to be of high enough quali¬ 
ty to warrant epidemiological analysis. Using this data base, 
the relationship between NO 2 ambient pollution levels and> 
acute respiratory disease in children was examined. Al¬ 
though a statistically significant relationship was found, it 
was not monotonic. Indeed, over the range of pollution val¬ 
ues experienced Lmore illness is associated with Jbw pollution 
values than with high ones, A U-shaped relationship be¬ 
tween illnets and! NO? concentrations was found in several 
aubpopulations in addition to the entire data set, although 
for some subpopulations no relationship was found. As far as 
we know, there is no clinical explanation for this result. In 
contrast, higher ambient sulfate levels were found to have a 
positive effect on acute respiratory disease incidence in chil¬ 
dren over the entire period and for different subsamples, 
although this effect was not significant for either season 
analyzed separately. 

The strange relationship between N0 3 concentrations and 
ARD in children could be attributable to three problems 
inherent in any epidemiological study. First, the relation¬ 
ship could be entirely fortuitous, although the odds against 
this for our study are long Second, both illness and NO 2 
could be related to some unobserved variable. However, such 
a variable must have strange properties, because for certain 
well-defined subsets, iu relationship to eitherilliiess or N0 2 
changes substantially. Finally, the data could still contain 
biases that create the observed effects. 

Im short, there is reason to be skeptical of a U-shaped 
dose-response function relating ambient NO 2 levels and 
acute respiratory disease. Nonetheless, we suggest that non¬ 
monotonic dose-res pons# functions be explicitly considered 
in future epidemiological or clinical research on the health 
effects of N0 2 and perhaps other pollutanu as welli 
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